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Abstract: Urban topsoil is the most frequent interface between human society and natural environment. 
The accumulation of heavy metals in the urban topsoil has a direct effect on residents' life and health. The 
geochemical baseline of heavy metals is an objective description of the general level of heavy metals in 
the urban topsoil. Meanwhile, the determination of geochemical baseline is necessary for regional 
environmental management, especially in coal cities prone to heavy metal pollution. Heavy metal pollution 
has become an environmental problem in Fuxin City, China for a long time. To establish the geochemical 
baseline of heavy metals in the topsoil of Fuxin City and to evaluate the ecological risk of the topsoil, we 
collected 75 topsoil samples (0—20 cm) and analyzed the concentrations of Cu, Ni, Zn, Pb, Cr, Cd, Hg and 
As through X-ray fluorescence spectrometry, atomic absorption spectrometry and inductively coupled 
plasma optical emission spectrometry. We determined the geochemical baseline of heavy metals in the 
topsoil of Fuxin City by using iteration removal, box-whisker plot, cumulative frequency curve and 
reference metal normalization; evaluated the contamination risk and ecological risk of the topsoil by using 
the baseline factor index, Nemerow index and Hakanson potential ecological risk index; and identified the 
source category of heavy metals in the topsoil by using a pedigree clustering heatmap. Results showed that 
the geochemical baseline values were 42.86, 89.34, 92.23, 60.55, 145.21, 0.09, 0.08 and 4.17 mg/kg for Cu, 
Ni, Zn, Pb, Cr, Cd, Hg and As, respectively. The results of Nemerow index and Hakanson potential 
ecological risk index indicated that the urban topsoil in the study area was slightly contaminated and 
suffering low potential ecological risk. The main contaminated areas dominated in the middle part and 
northeast part of the study area, especially in the western Haizhou Strip Mine. The result of baseline 
factor index indicated that Hg and Cd were the major pollution elements. Using a pedigree clustering 
heatmap, we divided the sources of these heavy metals into three types: type I for Ni and Cr, largely 
represented the enrichment of heavy metals from natural sources; type II for Cu, Pb, Zn, Cd and As, 
mainly represented the enrichment of heavy metals from anthropogenic sources; and type III for Hg, 
represented the form of both natural and anthropogenic inputs. 
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1 Introduction 


The geochemical baseline, which provides information for both the original geological 
background and the disturbance effect of human activities, has a crucial application in the 
study of regional topsoil pollution and the trend prediction of environmental change (Salminen 
and Tarvainen, 1997; Teng et al., 2003; Teng et al., 2009; Zhang et al., 2018; Lu et al., 2019; 
Zhang et al., 2020). The concept of a generalized geochemical baseline was first pointed out in 
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the 1970s (Tidball and Ebens, 1976). After more than 20 a of development, the study of 
geochemical baseline has received widespread attention, and some consensus has been reached 
on unifying the concept of geochemical baseline, deepening the theoretical connotation and 
identifying their influencing factors (Birke, 1993; Siegel, 1995). At the same time, survey and 
mapping of the regional geochemical baseline were systematically developed (Covelli and 
Fontolan, 1997; Darnley, 1997; Plant et al., 2000; Salminen and Gregorauskien, 2000; 
Reimann et al., 2005). Following the implementation of geochemical mapping projects in 
European countries, China launched a multi-objective regional geochemistry survey in the 
early 1990s and started large-scale research nationwide, laying the foundation of geochemistry 
in various areas (Wei et al., 1991). The result is still in use and has a far-reaching impact. In 
recent years, more attention has been focused on promoting the social application of the laws 
of geochemical baseline, which has been extended to the basin environment, river water 
quality, atmospheric dust, farmland soil, urban topsoil and other landscape areas, and has 
become a crucial part of global change researches (Shi et al., 2012; Wei and Wen, 2012; 
Beiseyeva et al., 2013; Tian et al., 2017). Therefore, with the deepening and popularization of 
geochemical baseline research, new questions arise one after another: Does the law obtained 
from previous large-scale research is applicable to small-scale areas? Is the common method 
applicable for all objects? These problems are still in the exploratory stage and need further 
study. 

As a typical mining city with rich mineral resources, there has been continual mining in 
Fuxin City, China for nearly 50 a. Due to the backward technology and extensive mining 
methods, resource depletion and environmental damage occurred in Fuxin City. Moreover, 
environmental risks caused by frequent mining activities are not totally assessed. Thus, the 
establishment of the geochemical baseline currently cannot only provide certain instructions 
for environmental management, but also set a scale plate for judging the availability of 
industrial development. The topsoil of urban area is the most important interface of material 
and energy interaction, and its geochemical baseline level has the significance for environment. 
Currently, most studies on the topsoil environment in this area have focused on heavy metal 
pollution in the mining area (Li et al., 2007; Zhang et al., 2010; Cong et al., 2017; Zhao et al., 
2017; Lian et al., 2019), however, studies on the geochemical baseline of the urban topsoil are 
limited. Therefore, four common methods, namely iterative removal, box-whisker plot, 
cumulative frequency curve and reference metal normalization were used to determine the 
geochemical baseline of the urban topsoil in Fuxin City. Accordingly, the baseline factor index, 
Nemerow index and Hakanson potential ecological risk index were used to evaluate the 
ecological risk of heavy metals in the urban topsoil, and the source classification of heavy 
metals was briefly analyzed using a pedigree clustering heatmap. The findings of this study 
will improve our understanding of the quality of the topsoil in Fuxin City and assist in 
providing fundamental data support for future environmental protection policies. 


2 Materials and methods 


2.1 Study area 


Fuxin City is located in the northwest of Liaoning Province, China (41°41’—42°56'N; 
121°01'-122°56'E). It is adjacent to the Horgin Sandy Land in the north, Liaohe River in the 
southeast and Nulu'erHu Mountain in the west. The terrain is high in the northwest and low in 
the southeast (Zhao et al., 2017). Fuxin City belongs to the semi-humid and semi-arid 
continental monsoon climate zone. The annual average temperature is 8.6°C and average 
annual precipitation is 565.6 mm. The dominant wind direction is southwest in summer and 
northwest in winter, and average wind speed is 2.8 m/s. The sampling plots in the main urban 
area of Fuxin City include Xihe district, Haizhou district and Taiping district with an area of 
176.0 km? (Fig. 1). 


2.2 Sampling and analysis 
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According to the survey method of the Technical Specification for Soil Environment Monitoring 
(Ministry of Environmental Protection of China, 2004), we collected topsoil samples in the main 
urban area of Fuxin City in August 2018. We laid out sampling points according to land type, 
population density and intensity differences of human activity (Fig. 1), and then picked up one 
sample of topsoil at the 0-20 cm depth and subsequently collected four replicated samples from 
each sampling site within a 5 mx5 m grid. After debris of animal and plant was removed from 
samples, we mixed five soil samples evenly. Next, about 1 kg of the mixed sample was stored in a 
polyethene sealing bag. There were 75 samples in total, with each functional zone having 15 
samples, and these functional zones included traffic, industrial mining, commercial, residential 
and green space areas. 

In laboratory, topsoil samples were naturally air-dried, crushed, sieved using a 2-mm sieve, and 
prepared for pressing slices or acid digestion (a mixture of HNO3, HF and HC103). 
Concentrations of Al, Fe, Ti, Rb, Cu, Ni, Zn, Pb and Cr were measured through X-ray 
fluorescence spectrometry (ZSX Primus II, Tokyo, Japan). Arsenic concentration was measured 
through atomic absorption spectrometry (Hitachi Z2000, Beijing, China), Hg and Cd 
concentrations were determined through inductively coupled plasma optical emission 
spectrometry (ICP-OES, PE, USA). 
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Fig. 1 Sampling sites and distribution in the urban topsoil of Fuxin City 


2.3 Methods 


2.3.1 Geochemical baseline value 


Four methods were used to determine the geochemical baseline value and the arithmetic average 
were considered as the final value. 

The box-whisker plot is an effective method to calculate geochemical baseline value, which is 
drawn from the minimum value, first quartile, median, third quartile and the maximum value. The 
calculation process is fulfilled by successively eliminating outliers outside of one and a half times 
interquartile range. Finally, the median of remaining data was considered as the geochemical 
baseline value. 

The iteration removal adopted double-standard deviation to quantitatively determine the 
geochemical baseline value (Reimann and Filzmoser, 2000). When test data showed normal 
distribution, the upper and lower limits of outliers could be identified by calculating the average, 
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plus or minus double-standard deviation (y7+2s). Outliers greater than (}+2s) and less than 
(x—2s) were successively removed until no outliers remained. Then, the arithmetic average of 


remaining data was considered as the geochemical baseline value. Test data were transformed via 
natural logarithm firstly if they showed the logarithmic normal distribution and the following 
processing is the same as normal distribution. 

A cumulative frequency curve was plotted by using the heavy metal concentration as the 
abscissas and cumulative distribution frequency as the ordinate (Teng et al., 2003; Wei and Wen, 
2012; Fan et al., 2014). In general, there may be one or two inflexion points on the curve, which 
symbolized the lower and upper limits of outliers. To find the position of inflexion points, we 
used the consecutive linear fitting. With the increase of samples of higher concentration involved 
in the fitting, inflection points will be found under linear model of R’>0.95 and P<0.05. The 
baseline value was calculated by all data points below the inflexion point when the curve only had 
one bend. However, if there were two, the shape of curve between two bends is considered a 
critical factor, i.e., data involved in computation should be included the part before the lower 
inflexion point (or before the upper inflexion point) if the middle part of the cumulative 
distribution curve is similar to the forepart (or the following part). The average value of data 
involved in computation was considered as the geochemical baseline value. 

The reference metal normalization method can be expressed as an equation established by the 
correlation between active and inert elements (Teng et al., 2003; Newman and Watling, 2007): 


C = aC, +b, (1) 


where Cm is the concentration of active element (mg/kg); Cy is the concentration of inert 


element (mg/kg); a is the regression coefficient; and b is the constant. As mentioned by Newman 
and Watling (2007), a suitable reference metal element is the basis of this method. Many studies 
have used Al, Fe, Ti, Rb, Li and V as the normalizer (Teng et al., 2003; Fan et al., 2014). In this 
study, Al, Fe, Ti and Rb were selected as the candidates to establish the regression equations, and 
the optimal normalizer was sifted through the most significant correlation coefficient. Based on 
fitting relations, we eliminated the samples that were not in 95% confidence intervals until no 
outliers remained. Finally, the regression parameters a and b were confirmed. By substituting 
average concentration of inert element into the equation, we calculated the geochemical baseline 
value. 


2.3.2 Contamination index 
The baseline factor index that reflects the degree of element contamination was calculated using 
the following equation (Zhu et al., 2016): 
P, =C; / B,, (2) 

where P; is the contamination index of heavy metal i; C; is the concentration of polluted element i 
(mg/kg); and B; is the theoretical baseline value (mg/kg). Grade criteria of contamination are 
proposed as follows: P;<1.20 indicates no contamination; 1.20<P;<1.50, slight contamination; 
1.50<Pi<2.00, moderate contamination; and P; >2.00, heavy contamination. 

The Nemerow index is often used to evaluate the comprehensive contamination of multiple 
heavy metals, with considering the difference in contaminated heavy metals, especially for the top 
one (Nemerow, 1974). The equation of Nemerow index is as follows: 


Fy = ‘ican + P?) / 2, (3) 


where Py is the Nemerow index of n® sample; Pave is the average value of P; index; and Pmax is 
the maximum P; index. Normally, Py can be classified as follows: Py<0.70 indicates safety; 
0.70<Pn<1.00, precaution; 1.00<Py<2.00, slight contamination; 2.00<Py<3.00, moderate 
contamination; and Py >3.00, serious contamination. 

The Hakanson potential ecological risk index is a classic method used for assessing the threat 
posed by heavy metals, and this index simultaneously considers the concentration, ecological 
effect, environmental effect and toxicological influence (Hakanson, 1980). The index is calculated 
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as follows: 
Maye ay a (4) 


where RI is the Hakanson potential ecological risk index; E; is the potential toxicity index of a 
single heavy metal i; and T; is the toxic response factor of the element i. The toxicity coefficient 
values are 5, 5, 1, 5, 2, 30, 40 and 10 for Cu, Ni, Zn, Pb, Cr, Cd, Hg and As, respectively, as 
specified by Hakanson (1980). The RI index is grouped into four grades: RI<150 indicates low 
ecological risk; 150<R/<300, moderate ecological risk; 300<R/<600, considerable ecological risk; 
and R/=600, serious ecological risk. 


2.3.3 Pedigree clustering heatmap 


The pedigree clustering heatmap is a visualized multivariate statistical method, which performs 
hierarchical clustering analysis on the data matrix along the reordered columns and rows, and 
draws the heatmap with a dendrogram. By clustering, the groups with similar or vastly different 
expression values are easily visible (Wilkinson and Friendly, 2009). In the current study, the 
"heatmap with dendrogram" plug-in in OriginLab 2018 software was used to complete the 
analysis the source of heavy metals. Specifically, given a selected color key, the inputted heavy 
metal concentrations are automatically standardized and marked in a specific color. The smaller 
the distance cluster, the higher the similarity between objects. The average Euclidean distance and 
Ward Euclidean distance are employed for measuring the distance of 8 heavy metals and the 
distance of 75 samples, respectively. 


3 Results 


3.1 Concentrations of heavy metals 


The results showed that Ni concentration was normally distributed, while the concentrations of 
other seven heavy metals were logarithmic normally distributed (Table 1). Except for As, the 
mean concentrations of other elements were higher than the background value of Liaoning 
Province. Especially the maximum concentration of Cr reached 1095.4 mg/kg, which was nearly 
20 times the background value. Similarly, the maximum value of Hg was 0.64 mg/kg, 
approximately 17 times the concentration of the background value. The mean concentrations of 
the eight heavy metals divided by the corresponding background value of Liaoning Province 
increased in the order of As<Cd<Zn<Cu<Hg<Cr<Pb<Ni. In addition, the concentrations of heavy 
metals showed strong variations. The coefficient of variations (CVs) of As, Zn, Cr, Hg and Cd 
exceeded 50.0%; in particular, the CVs of Hg and Cd were 93.8% and 132.1%, respectively. The 
high spatial variation in these elements suggested the presence of point source pollution to some 
extent, which might be influenced by human activities, specifically for Cd and Pb concentrations 
in the traffic areas. The concentrations and CVs of Cu, Zn and Hg in the industrial mining areas 
were higher than those in other functional areas. Similarly, the CVs of Hg and Zn in the 
commercial and residential areas were higher. The differences among the heavy metal 
concentrations in various functional areas were mainly affected by the intensity or frequency of 
industrial activities, power generation, industrial waste production, etc. Moreover, the 
concentration of Zn was slightly lower; however, the concentration of Cr was higher in the green 
space areas; this might also be attributed to human activities, such as industrial wastewater. 


3.2 Geochemical baseline values 


The box-whisker plot of the eight heavy metals in the topsoil of Fuxin City is shown in Figure 2. 
After outliers were repeatedly removed, geochemical baseline values calculated by this method 
for Cu, Ni, Zn, Pb, Cr, Cd, Hg and As were 42.34, 92.33, 86.77, 60.80, 141.63, 0.08, 0.07 and 
4.95 mg/kg, respectively (Fig. 2). Using the iterative removal method, the geochemical baseline 
values for Cu, Ni, Zn, Pb, Cr, Cd, Hg and As were 42.08, 87.35, 88.82, 63.56, 137.69, 0.08, 0.07 
and 3.58 mg/kg, respectively. 


Table 1 Concentrations of heavy metals in the urban topsoil of Fuxin City 
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Functional item Cu Ni Zn Pb Cr Cd Hg As 
area (mg/kg) (mg/kg (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) 
Minimum 20.00 38.50 40.17 27.85 75.26 0.02 0.03 0.73 
iire Maximum 156.58 207.45 510.16 168.95 1095.42 1.18 0.64 8.40 
og Average 45.13 91.84 107.96 63.15 162.50 0.13 0.11 4.36 
CV (%) 36.54 36.23 61.51 32.31 73.01 132.05 93.08 53.67 
Minimum 29.56 51.08 41.78 34.35 98.53 0.05 0.03 1.32 
Traffic Maximum 72.70 146.16 279.58 168.95 239.47 1.18 0.44 8.40 
area Average 48.84 95.03 117.18 69.01 148.47 0.28 0.12 5.33 
CV (%) 20.60 28.73 49.37 44.97 29.32 120.72 82.48 38.84 
Minimum 26.36 44.79 49.01 39.92 104.00 0.03 0.04 0.94 
Industrial Maximum 156.58 186.23 334.21 112.33 276.42 0.21 0.64 8.11 
maS Average 50.22 91.31 120.67 60.96 146.69 0.10 0.16 4.65 
CV (%) 58.45 42.36 53.62 30.55 28.95 45.96 92.00 46.96 
Minimum 32.75 48.72 75.52 27.85 75.26 0.03 0.06 1.00 
Commercial Maximum 73.50 121.01 200.05 128.11 197.05 0.18 0.57 8.09 
area Average 47.24 76.01 110.82 62.63 152.08 0.11 0.13 4.83 
CV (%) 27.03 31.90 29.74 33.14 25.02 31.74 94.97 46.60 
Minimum 27.16 51.86 44.56 43.63 87.58 0.03 0.04 0.73 
Residential Maximum 57.52 136.73 510.16 77.98 186.79 0.15 0.24 8.28 
area Average 39.20 95.24 103.77 62.32 127.95 0.07 0.09 4.10 
CV (%) 17.93 28.35 106.10 13.79 25.96 50.56 53.56 64.62 
Minimum 20.00 38.50 40.17 31.56 106.74 0.02 0.03 1.38 
Green Maximum 51.93 207.45 141.40 77.05 1095.42 0.21 0.16 6.22 
SPA? Average 40.16 101.63 87.36 60.84 237.28 0.09 0.06 2.88 
CV (%) 22.30 39.48 28.35 24.45 100.42 64.24 48.63 57.24 

Distribution pattern L N L L L L L L 
Background value of 19.80 25.60 63.50 21.40 57.90 0.11 0.04 8.80 


Liaoning Province? 


Note: *, CNEMC (China National Environment Monitoring Centre, 1990); CV, coefficient of variation; N, normal distribution; L, 
logarithmic normal distribution. 
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Fig. 2 Geochemical baseline values calculated by the box-whisker method 


The cumulative frequency curves for the eight heavy metals showed that there was a single 
inflection point in the curves for Cu, Hg and As (Fig. 3). Among them, sample points participating 
in the calculation of geochemical baseline values for Cu and As accounted for 87.5% and 85.0%, 
respectively, whereas Hg accounted only for 44.4%, suggesting that the accumulation of Cu and 
As in the topsoil had a consistent changing trend, whereas the contamination of Hg was abnormal. 
The baseline values of Cu, Hg, and As were 42.08, 0.06 and 3.87 mg/kg, which were calculated 
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by all the sample points below the inflection point. In addition, two inflection points occurred in 
the cumulative frequency curves of Ni, Zn, Pb, Cr and Cd. Moreover, the curve shape between the 
two inflections was similar to that after the upper outlier. Therefore, the baseline values of these 
five elements were calculated as 84.62, 87.87, 58.73, 140.01 and 0.09 mg/kg using the data before 
the upper outlier. Finally, the geochemical baseline values of Cu, Ni, Zn, Pb, Cr, Cd, Hg and As 
obtained by this method were 42.08, 84.62, 87.87, 58.73, 140.01, 0.09, 0.06 and 3.87 mg/kg, 
respectively. 
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Fig.3 Geochemical baseline values calculated by the cumulative frequency curve 


In the processing of reference metal normalization, the correlation between the active and inert 
elements was first analyzed and correlation coefficients are shown in Table 2. Using the 
significance of the correlation as criteria (P<0.01) for selection, we identified the optimal 
normalizers as Ti-Cu, Fe-Ni, Al-Zn, Ti-Pb, Fe-Cr, Al-Cd, Al-Hg and Ti-As (Table 2). According 
to Equation 1, we performed linear regressions for the eight active elements on reference 
elements. After iteratively eliminating the samples lying outside 95% confidence intervals, we 
established new linear equations for predicting the geochemical baseline value (Table 2). Finally, 
by substituting the average concentration of reference element into the new equation, we 
calculated the geochemical baseline values as 44.93, 93.06, 105.47, 59.09, 161.51, 0.11, 0.10 and 
4.25 mg/kg for Cu, Ni, Zn, Pb, Cr, Cd, Hg and As, respectively. 

The average values of the geochemical baseline results obtained from the four methods were 
considered as the final value. Consequently, the final geochemical baseline values for the urban 
topsoil in Fuxin City were 42.86, 89.34, 92.23, 60.55, 145.21, 0.09, 0.08 and 4.17 mg/kg for Cu, 
Ni, Zn, Pb, Cr, Cd, Hg and As, respectively (Table 3). Among the eight heavy metals, 
geochemical baseline values of Cd and As were lower than the background value of Liaoning 
Province; however, the geochemical baseline values for the rest of heavy metals were 
significantly higher. The geochemical baseline values of Ni, Pb and Cr were almost triple the 
background value, whereas those of Cu, Hg and Zn were approximately twice the background 
value, indicating serious heavy metal accumulation in the study area. Meanwhile, a comparison of 
the geochemical baseline value with other resource-oriented cities was performed. We observed 
that the concentrations of Ni, Pb and Cr were higher, but those of Cd and As were lower in Fuxin 
City than those in most of the other cities. In addition, the concentrations of Cu, Zn and Hg were 
similar to those reported in other cities (Table 4). It can be observed that the differences in an 
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urban setting among these resource-oriented cities were relatively obvious and the main reason 
probably has more to do with the economic activity. 


Table 2 Correlation coefficients and regression equations of inert element and active element 


Element Al Fe Ti Rb Regression equation R P 
Cu —0.256" 0.272° ~0.340" —0.200 y= —59.97x+70.54 0.99 0.00 
Ni 0.073 0.314" 0.132 —0.007 y= —27.12x+5.65 0.92 0.00 
Zn —0.444" 0.093 —0.268* —0.284* y= —31.05x+320.33 0.98 0.00 
Pb 0.057 0.079 0.315% 0.042 y= —15.08x+52.60 0.89 0.01 
Cr 0.107 0.566% 0.275* —0.016 y= —52.81x-7.97 0.98 0.00 
Cd -0.457™" 0.016 —0.278* —0.321** y= —0.04x+0.38 0.97 0.00 
Hg —0.306" —0.265* —0.108 —0.038 y= —0.03x+0.29 0.96 0.00 
As —0.179 0.060 ~0.328" —0.148 y= -16.52x+11.28 0.99 0.00 


Note: * and ** indicate significant correlations at P<0.05 and P<0.01 levels, respectively. 


Table 3 Geochemical baseline values computed by various methods in the urban topsoil of Fuxin City 


Cumulative frequency curve Background 
Reference ‘ ; 
i 7 tal Box- Iteration Final value of 
Element First Second Baseline vl s ti Whisker removal value Liaoning 
bend bend value me ae ees (mg/kg) (mg/kg) (mg/kg) Province 
(mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) 
Cu 59.25 44.93 42.08 44.93 42.34 42.08 42.86 19.80 
Ni 97.50 93.06 84.62 93.06 92.33 87.35 89.34 25.60 
Zn 110.00 105.47 87.87 105.47 86.77 88.82 92.23 63.50 
Pb 50.25 59.09 58.73 59.09 60.80 63.56 60.55 21.40 
Cr 141.00 161.51 140.01 161.51 141.63 137.69 145.21 57.90 
Cd 0.10 0.11 0.09 0.11 0.08 0.08 0.09 0.11 
Hg 0.09 0.10 0.07 0.10 0.07 0.07 0.08 0.04 
As 7.30 4.25 3.88 4.25 4.95 3.58 4.17 8.80 
Table 4 Comparison of geochemical baseline value with other literature 
Sampling site cu Ni Zi re E cd Hg A Reference 
A (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) (mg/kg) 
Baotou, daner 20.37 17.98 61.42 23.52 118.47 , - 8.65 Zhu et al. (2016) 
Mongolia 
Bayan Obo, Inner 9646 29.16 76.71 29.07 124.37 - - 16.09 Zhu et al. (2016) 
Mongolia 
Dexing, Jiangxi 58.90 - 138.54 54.61 72.87 0.37 0.100 14.00 Liu et al. (2014) 
Urumdi, Xinjiang 94.54 43.28 294.47 53.53 54.28 1.17 - - Wei et al. (2010) 
Zigong, Sichuan 23.10 24.50 62.00 28.20 66.40 0.18 0.037 3.86 Liet al. (2012) 
Chengdu, Sichuan - 32.00 535.00 281.00 110.00 2.94 0.630 28.60 Shi et al. (2012) 
Tongling, Anhui 47.70 25.20 113.00 39.00 73.50 0.33 0.051 24.30 Jia et al. (2019) 
Suzhou, Anhui 16.97 - 48.66 19.43 65.22 - - 9.07 Gao et al. (2018) 
Yanzhou, Shandong 34.34 33.27 89.86 27.39 75.27 0.14 0.069 7.77 Zhang (2007) 
Fuxin, Liaoning 42.86 89.34 92.23 60.55 145.21 0.09 0.080 4.17 This study 


Note: - means no available value. 


3.3 Contamination assessment 


In the present study, P;, Pn, and RI were used to assess the level of contamination and the 
potential risk in the urban topsoil of Fuxin City. The P; values for Cu, Ni, Zn, Pb, Cr, Cd, Hg and 
As were in the ranges of 0.47-3.65, 0.43-2.32, 0.44-5.53, 0.46-2.79, 0.52-7.54, 0.23-13.23, 
0.39-8.31 and 0.18-2.02, respectively, meanwhile, the mean P; decreased in the order of Hg 
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(1.46)>Cd (1.45)>Zn (1.17)>Cr (1.12)>Cu (1.05)>As (1.04)>Pb (1.03)>Ni (1.02) (Fig. 4). 
According to the grade criteria, P; values of 1.20-1.50 indicate that the samples are slightly 
contaminated, whereas P; value of <1.20 suggests a deficiency to minimal enrichment. Thus, 
among the eight heavy metals, Hg and Cd (slightly contaminated) were identified as the major 
pollutant that caused the contamination in the topsoil of Fuxin City, while the contamination of 
other heavy metals was not obvious. Moreover, a portion of P; for Cd, Hg, Cr and Zn exceeded 
2.00. Particularly, the maximum P; was 13.23 for Cd and 8.31 for Hg, indicating some severe 
contamination, probably due to anthropogenic point sources. In addition, the mean P; values for 
each heavy metal from different functional areas were compared. Overall, the P; values for Cd, 
Hg, Cu, Zn, Pb and As, especially for Cd and Hg (heavily contaminated), in the traffic and 
industrial mining areas were higher among the five functional areas, which might have resulted 
from high-intensity human productive activity. By contrast, a relatively low level of P; value of 
each heavy metal in the commercial and residential areas was found. In green space, the 
enrichment of Ni and Cr was higher than those in other functional areas, indicating that the 
pollution sources were different from those of Cd and Hg. 


Whole study area Œ Industrial mining area @ Residential area —- Average P, value 
@ Traffic area Commercial area E Green space 


Cu Ni Zn Pb Cr Cd Hg As 
Heavy metal 


Fig. 4 Pi (contamination index) values for heavy metals in the urban topsoil of Fuxin City 


The Py values for the 75 samples ranged from 0.80 to 9.56, with a mean value of 1.96 (Table 5). 
It should be noted that the mean Py was very close to 2.00, although it was ranked as slight 
contamination. Moreover, some samples that were evaluated as seriously contaminated were 
found in functional area, indicating that the enrichment of heavy metals in the urban topsoil was 
relatively obvious, especially in the traffic area (PN=2.93) and industrial mining area (PN=2.04). 
The RZ values varied between 56.63 and 483.98, with a mean value of 131.54 (Table 5), indicating 
a low potential ecological risk in the topsoil, while 76.0%, 17.3% and 6.7% of samples had low 
potential risk, moderate potential risk and considerable potential risk, respectively. In addition, 
most of the contaminated samples were obviously located in the traffic areas (R/J=186.78) and 
industrial mining areas (R/=148.22), followed by commercial areas. The lowest ecological risk 
was observed in the residential and green space areas. 


Table 5 Pn (Nemerow index) and R7 (Hakanson potential ecological risk index) values for heavy metals in the 
urban topsoil of Fuxin City 
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Functional area Py valus AA 
Range Mean Rating range Range Mean Rating range 
Whole study area 0.80-9.56 1.96 Precaution—seriously 56.63-483.98 131.54 Low-considerable 
Traffic area 1.12-9.56 2.93 Slightly—seriously 77.39-483.98 186.78 Moderately—considerable 

D gee 1.11-6.09 2.04 — Slightly-seriously  95.49-381.92 148.22 Low-considerable 
Commercial area 1.11-5.33 1.66 Slightly—seriously 89.26-324.03 132.03 Low-considerable 
Residential area 1.10-4.07 1,53 Slightly—seriously 63.59-154.11 98.54 Low-—moderately 
Green space area 0.80-5.50 1.64 Precaution—seriously 56.63—159.19 92.12 Low-—moderately 


The spatial distributions of Px and RI were quite similar, with high values in the middle part and 
northeastern part of the study area (Figs. 5 and 6). Specifically, the middle section of Meicheng 
Road (sample 12), located in the west of Haizhou Strip Mine, was the most polluted (Px=9.56 and 
RI=483.98). Relatively lower contamination or potential risk was observed in the eastern section of 
Yulong Road (sample 60), the middle part of Xihe District (sample 54), and the region between the 
Meicheng Road and Bayi Road. In fact, more than 70% of the residents were living here and most 
economic activities, such as traffic emissions, coal combustion, industrial production, thermal 
power generation, etc. were carried out in the area, which could easily lead to the mixed 
accumulation of multiple heavy metals. In addition, a sample in the south of Yulong Lake was 
seriously contaminated (Py=5.50), but has lower potential ecological risk, which was due to the 
anomalous enrichment of Cr by anthropogenic activities. However, the enrichment of Cr was 
characterized by a low toxicity. 


121°3S'E 121°36E 121°37E 121°38E 121°39E 121°40'E 121°41'E 121°47'E 121°43'E 121°44'E 


121°3S'E 121°36E 121°37'E 121°38'E 121°39E 121°40E 121°41I'E 121°42E 121°43'E — 121°44'E 
Fig. 5 Spatial distribution of Pn (Nemerow index) value in the urban topsoil of Fuxin City 
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121°3S'E 121°36E 121°37'E 121°38'E 121°39E 121°40'E 121°41'E 121°42'E 121°43'E  121°44'E 
Fig. 6 Spatial distribution of RJ (Hakanson potential ecological risk index) value in the urban topsoil of Fuxin 
City 


3.4 Pedigree clustering heatmap 


Using the pedigree clustering heatmap, we divided 75 samples into three groups at the Euclidean 
distance of 23.10, i.e., groups S-a, S-b and S-c (Fig. 7). Among them, the samples in the group 
S-a are mainly characterized by high color key values of single or multiple elements. Obviously, 
the maximum values of Cu, Pb, Zn, Cd, Hg and Cr are all in this group, indicating that this group 
contains the abnormal enrichments of heavy metal influenced by human activities. There is no 
significant difference in the color key values between the samples in group S-b. Notably, the color 
key value of As element is generally higher than those of other elements, indicating a nonpoint 
source pollution of As. In group S-c, except for a few samples that enriched in Ni and Hg, most of 
the samples had low color key values, indicating less contamination. 

Meanwhile, we divided the eight heavy metals into three types at the Euclidean distance of 
12.27 (Fig. 7). (1) Type I for Ni and Cr: with the exception of a very few isolated high color keys, 
almost all of the color keys in this group had low values. (2) Type II for Cu, Pb, Zn, Cd and As: A 
correlation was observed among Cu, Pb and Zn with an average distance of 9.68, and these three 
elements formed a group with Cd and As with an average distance of 11.31, suggesting that Cu, 
Pb and Zn had homology with Cd and As. The color key values of this group obviously reflected 
the simultaneous enrichment of multiple heavy metals in some samples, which probably had a 
common source caused by human activities. (3) Type II for Hg: When the average distance was 
12.29, Hg showed a relationship with type II; however, when the average distance was 12.27, Hg 
formed an independent group. This observation indicated that its clustering characteristic was 
different from other elements, and it tended to pair up with type II at some level. 


4 Discussion 


4.1 Comparision of geochemical baseline values obtained from different methods 


The accumulation of heavy metals in the urban topsoil has always attracted much attention, 
especially the type and intensity of heavy metal enrichment caused by human activities. However, 
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Fig. 7 Pedigree clustering heatmap of heavy metals in the urban topsoil of Fuxin City. The color key represents 
a weighted value of each element concentration. A high color key value represents the severe enrichment (such as 
red), whereas a low value represents the low enrichment (such as purple). 


in many cases, due to the limitations of factors such as outdated in timescale or mismatched in 
spatial scale, there are no ideal heavy metal standard reference values for soil environmental 
research in certain specific areas. It is necessary to establish an environmental geochemical 
baseline that is applicable to the area for assessing the quality of urban topsoil and impact of 
humans on the topsoil environment. In the process of establishing a geochemical baseline, the 
results obtained from different methods may be different. For example, both the iteration removal 
method and box-whisker method used in this study eliminate outliers firstly and then calculate the 
baseline value based on the remaining data. However, the former eliminates outliers outside of 
double-standard deviation range and calculates the arithmetic mean of remaining data as the 
baseline value, while the latter eliminates outliers outside one and a half times interquartile range 
and calculates the median of remaining data as the baseline value. Therefore, it can be inferred 
that if the concentration distribution of heavy metals is relatively concentrated, the results of the 
two should be close; however, if the concentration distribution is relatively discrete, then the 
baseline results are very likely to be different. The cumulative frequency curve method and 
reference metal normalization method are more complicated than the above two methods. The 
cumulative frequency curve method focuses more on the identification of any abnormal 
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enrichment. The key link is to determine the inflection point, which is often regarded as the limit 
of different sources of heavy metals. Any sample data with a concentration lower than the 
inflection point will participate in the calculation of geochemical baseline value, including the 
abnormal low concentration values. Therefore, the geochemical baseline value obtained by this 
method may be slightly lower than other methods. As for the reference metal normalization 
method, although this method greatly reduces the impact of granularity, it introduces the reference 
element to participate in the calculation of geochemical baseline value, and the concentration 
distribution of reference element itself influences the geochemical baseline result to a certain 
extent. According to previous reports in the literature and results of this study, we observed that 
the geochemical baseline value obtained by this method is slightly higher than those of 
above-mentioned methods. Of course, the difference between these four methods is also 
influenced by the dispersion of statistical data; the greater the dispersion of data, the greater the 
difference between results. It is not difficult to conclude that each method has its own advantages; 
however, the core processing idea is to eliminate outliers and then locate the lower limit of 
outliers. Therefore, in actual research, a variety of baseline determination methods should be used 
simultaneously to greatly improve the accuracy and usability of the geochemical baseline value. 


4.2 Indicative significance of the lower limit of abnormal values 


In geochemistry research, it is a conventional method to distinguish the lower limit of abnormal 
values and calculate the geochemical baseline or background value accordingly to identify the 
heavy metal sources. However, using the geochemical baseline value of topsoil samples as the 
background value and considering the lower limit of abnormal values as the boundary between 
natural and anthropogenic sources are common phenomena. Although mathematical methods for 
determining geochemical baseline and background values are almost the same, the meaning of the 
lower limit of abnormal values is quite different. The geochemical baseline of topsoil represents 
the ordinary concentration level of heavy metals in the topsoil, and its most important function is 
to find a heavy metal concentrated amount in the topsoil at the sampling time, which can be used 
as a reference standard. The "ordinary concentration level" here is premised on eliminating 
abnormal enrichment (an important step in various geochemical baseline determination methods), 
in which the excluded abnormal values are considered to be caused by anthropogenic disturbance 
(Teng et al., 2003). Whereas, the remaining data involved in the geochemical baseline calculation 
represents the "natural concentration" almost without human effect; however, may also contain a 
degree of human influence (Zhang et al., 2006). Actually, in a stable deposition layer or soil 
horizon, the probability of bottom samples being disturbed by human activities is low as 
compared with the top samples, therefore, the lower limit of abnormal values is more reliable as 
the boundary between natural and anthropogenic sources (Rattenbury et al., 2018). If the 
existence of significant difference in the concentration of topsoil is not certain, the lower limit of 
abnormal values only indicates the boundary between global enrichment and regional enrichment. 
Therefore, it is difficult to determine the concentration that is maintained without human 
disturbance. Precisely, if there is nonpoint source contamination caused by human activities, this 
part of concentration, which may be indistinguishable from natural environment and could not 
reveal in the form of outliers, is extremely likely to be considered as the concentration in natural 
environment. Many topsoil geochemical baseline studies for different objects, especially for the 
resource-based cities, have generally observed that the geochemical baseline values of various 
heavy metals were significantly higher than those of earlier ones (Fan et al., 2014; Li et al., 2016; 
Zhou et al., 2019). A similar widespread accumulation of multiple elements has also been 
observed in this study, and the increase in concentration probably included the indistinguishable 
part, as mentioned above. Accordingly, not all the lower limits of abnormal values always 
represented the boundary between natural and anthropogenic sources; in particular, the topsoil 
was chosen as the experimental material. 


4.3 Identification of the source of heavy metal contamination 


Considering that the object selected in this study is topsoil, it is not suitable to directly distinguish 
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the source of heavy metals based on its geochemical baseline value. Therefore, a pedigree 
clustering heatmap is used to classify and identify the main source types of heavy metals in the 
topsoil. Among the eight heavy metals involved in the cluster analysis, Ni and Cr are divided into 
a group. The few high-concentration samples of the two are mostly distributed in the green spaces 
and factory courtyards on the fringe of city. Among them, the sample with the maximum 
concentration of Cr occurs in the green space near the Municipal Infectious Disease Hospital on 
the south side of Yulong Lake, and it may be contaminated by a certain chromium-containing 
medical material. The soil composition of the other sites, according to the survey, is a mixture of 
coal ash backfill and excavation slag. Except for the above-mentioned samples that are obviously 
affected by human activities, the concentration values of the vast majority of other samples have 
little variation; more importantly, most samples with slightly higher concentrations are evenly 
distributed in the fringes of the main urban area, especially the underdeveloped areas at the north 
and south ends of the urban area with low population density and less human disturbance. This 
shows that the main enrichment modes of Ni and Cr are not controlled by social production 
activities but by regional geological conditions and soil parent materials. Moreover, Ni and Cr 
belong to the iron group elements and often present a high correlation in the soil, especially under 
strong adsorption of natural formed soil colloids that are easier to be enriched (Lii et al., 2018; 
Zhang et al., 2020). Therefore, the enrichment sources of this group of heavy metals are mainly 
natural sources (mixed with a very small number of point source enrichment caused by human 
activities). 

Cu, Pb, Zn, Cd and As are classified in the same group, indicating that the accumulation of 
these heavy metals has a certain similarity in the source. Considering the results of sample 
clustering, heavy metal pollution evaluation and potential ecological risk assessment, it can be 
found that some samples are simultaneously enriched in some of these five elements, and what's 
more, these samples are mainly concentrated in the central and northeastern parts of the main 
urban area where heavy metal pollution and potential ecological risks are the most serious 
(Figs.5—7), indicating that the enrichment of these elements is very likely to be homologous. 
These two regions not only gather most of population of Fuxin City, but also gather a variety of 
social and economic occurrence areas such as industry and mining, commerce, manufacturing and 
transportation. This also indicates that the enrichment of above-mentioned heavy metals in these 
areas mainly originates from anthropogenic activities. More specifically, economic activities in 
the study area, such as coal washing, coal preparation, thermal power generation, chemical 
manufacturing, leather processing and mining transportation related to Haizhou Strip Mine, 
mechanical processing, stone and building materials processing in the northeastern part of the city, 
can provide sources for the enrichment of Cu, Pb, Zn and Cd. And the old mining area in southern 
Xi River has caused As enrichment in a large range. In addition, planting, animal husbandry, 
garbage incineration and discharge of wastes have also been confirmed to cause the accumulation 
of heavy metals (Li et al., 2010; Chen et al., 2012; Wang et al., 2016; Xue et al., 2018; Han et al., 
2019; Wang et al., 2019). 

The enrichment of Hg is different from other elements. Most Hg-contaminated samples are 
distributed along the prevailing wind direction from southwest to northeast in the study area. 
Generally, the accumulation of Hg is often closely related to coal and metal ore resources (Jiang 
et al., 2005; Cai et al., 2012). Moreover, Hg is a liquid element with volatility; when it is in a 
volatile state, its enrichment direction will be consistent with the prevailing wind direction (Lii et 
al., 2018). It is speculated that Hg enrichment in the study area is controlled by natural conditions 
to a certain extent. At the same time, however, the concentration of Hg in the sample is highly 
variable, and the contamination of some samples is serious, reflecting that enrichment of Hg has 
been significantly affected by humans. One obvious reason is that Haizhou Strip Mine was once 
the largest strip mine in Asia. It has a long history of using coal-fired power generation and 
heating resources, which may provide an important source of Hg enrichment. Moreover, Hg 
enrichment caused by such man-made activities may be relatively more. Therefore, it can be 
inferred that the source of Hg enrichment is a mixed input of anthropogenic and natural sources. 
In addition, the contamination degree of heavy metals in the topsoil of different functional areas 
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in Fuxin City is relatively close, which is obviously different from some non-resource-based cities. 
This phenomenon precisely reflects the development characteristics of Fuxin City built on the 
coal. The economic activities and urban expansion of the city are almost entirely developed 
around mining areas. As a result, the functional areas are nested with each other and have little 
difference in the contamination of heavy metals. 


5 Conclusions 


The geochemical baseline of the urban topsoil is one of the important indices for representing the 
environmental quality of topsoil. As a typical coal city, Fuxin City is in the critical period of 
economic transformation, and its topsoil environmental quality is one of the reference standards 
to evaluate whether the transformation practice meets environmental friendliness. Our results not 
only analyze the current geochemical baseline of heavy metals in the urban topsoil of Fuxin City 
but also summarize the accumulation trend of heavy metals in the topsoil, which can be divided 
into anthropogenic sources, natural sources and a mixed source. At the same time, the 
contamination risk and ecological risk caused by the accumulation of heavy metals in the topsoil 
was also evaluated, and the characteristics of the relatively low risk of heavy metal contamination 
in the overall topsoil in the study area were identified, along with spatial contamination 
characteristics dominated by the middle part and northeastern part of the study area, especially the 
west of Haizhou Strip Mine. 
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